Second Law of Thermodynamics with Discrete Quantum Feedback Control 
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A new thermodynamic inequality is derived which leads to the maximum work that can be 
extracted from multi-heat baths with the assistance of discrete quantum feedback control. The 
maximum work is determined by the free-energy difference and a generalized mutual information 
content between the thermodynamic system and the feedback controller. This maximum work can 
exceed that in conventional thermodynamics and, in the case of a heat cycle with two heat baths, 
the heat efficiency can be greater than that of the Carnot cycle. The consistency of our results 
with the second law of thermodynamics is ensured by the fact that work is needed for information 
processing of the feedback controller. 
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Among a large number of studies conducted on the 
relationship between thermodynamics and information 
processing 0, |, i |, 1, S 0, g @, M EL H, M M , 
particularly provoking is the work by Szilard |2|] who ar- 
gued that positive work W ox t can be extracted from an 
isothermal cycle if Maxwell's demon plays the role of a 
feedback controller 0]. It is now well understood that 
the role of the demon does not contradict the second law 
of thermodynamics, because the initialization of the de- 
mon's memory entails heat dissipation 0, 0, 0] . We note 
that, in the case of an isothermal process, the second law 
of thermodynamics can be expressed as 



W ext < —AF , 



(1) 



where AF S is the difference in the Helmholtz free energy 
between the initial and final thermodynamic equilibrium 
states. 

In a different context, quantum feedback control has 
attracted considerable attention for controlling and sta- 
bilizing a quantum system [3, [13, El H HE EH HI . It 
can be applied, for example, to squeezing an electromag- 
netic field [l8| , spin squeezing [2(| , and stabilizing macro- 
scopic coherence 22j. While the theoretical framework 



of quantum feedback control as a stochastic dynamic sys- 
tem is well developed, the possible thermodynamic gain 
of quantum feedback control has yet to be fully under- 
stood. 

In this Letter, we derive a new thermodynamic inequal- 
ity which sets the fundamental limit on the work that can 
be extracted from multi-heat baths with discrete quan- 
tum feedback control @, H3] , consisting of quantum mea- 



surement 



24, 2 



and a mechanical operation depending 
on the measurement outcome. The maximum work is 
characterized by a generalized mutual information con- 
tent between the thermodynamic system and the feed- 
back controller. We shall refer to this as the QC-mutual 
information content, where QC indicates that the mea- 



sured system is quantal and that the measurement out- 
come is classical. The QC-mutual information content 
reduces to the classical mutual information content [23| 
in the case of classical measurement. In the absence of 
feedback control, the new inequality (fT2|) reduces to the 
Clausius inequality. In the case of an isothermal pro- 
cess, its upper bound exceeds that of inequality ([!]) by 
an amount proportional to the QC-mutual information 
content. 

We consider a thermodynamic process for system S 
which can contact heat baths B 1; B 2 , • • • , B„ at respec- 
tive temperatures Ti, T 2 , • • • , T n . We assume that system 
S is in thermodynamic equilibrium in the initial and fi- 
nal states. For simplicity, we also assume that the initial 
and final temperature of S is given by T = (k-B/3)^ 1 . This 
can be realized by contacting S with, for example, Bi in 
the preparation of the initial state and during equilibra- 
tion to the final state; in this case T = T\. We do not, 
however, assume that the system is in thermodynamic 
equilibrium between the initial and final states. 

We assume that system S and heat baths B m are as 
a whole isolated and that they only come into contact 
with some external mechanical systems and the feedback 
controller. Apart from the feedback controller, the total 
Hamiltonian can be written as 



H(t) = H s {t) + ^ {H SBm (t) + H l 



(2) 



where H SBm (t) is the interaction Hamiltonian between 
system S and heat bath B m . The Hamiltonian H s (t) 
describes a mechanical operation on S through such ex- 
ternal parameters as an applied magnetic field or volume 
of the gas, and the Hamiltonian H SB ™ (t) describes, for 
example, the attachment (detachment) of an adiabatic 
wall or B m to (from) S. We consider a time evolution 
from U to t { , assume H SBm (ti) = H SBm (t { ) = for all 
m, and write H s (ti) = flf and H s (t f ) = Hf. The time 



2 



evolution of the total system with discrete quantum feed- 
back control can be divided into the following five stages: 
Stage 1 (Initial state) At time tj, the initial state of 
S and that of B m are in thermodynamic equilibrium at 
temperatures T and T m , respectively. We assume that 
the density operator of the entire state is given by the 
canonical distribution 



Pi 



exp(-06?) exp(-/3 lJ ff Bl ) 



Zf 



Z B„ 



(3) 

where (3 m = (feeT™)" 1 (m = l,2,---,n), Zf = 
tr{exp(-/?fff)}, and Z B ™ = tr{exp(-/3 m ff B '" )}. We 
denote the Helmholtz free energy of system S as F ; s = 
-k B T\nZf. 

Stage 2 (Unitary evolution) From t\ to t±, the 
entire system undergoes unitary evolution Ui = 
Texp(f£ H(t)dt/ihy 

Stage 3 (Measurement) From t\ to t^, the feed- 
back controller performs quantum measurement on S 
described by measurement operators {Mk} and obtains 
each outcome k with probability pk- Let X be the set 
of outcomes fc's, and {£>k} be POVM as defined by 
Dk = Mf,Mu\ we then havepfc = ti(Dkp). We denote the 
pre-measurement density operator of the entire system as 
pi , the post-measurement density operator with outcome 

k as fa = MkpMl/pk, and define p 2 = Y^,kPkP2 ■ Notc 
that our scheme can be applied not only to a quantum 
measurement, but also to a classical measurement which 
can be described by setting [pi,Dk] — for all k. 

Stage 4 (Feedback control) From t 2 to £3, the feedback 
controller performs a mechanical operation on S depend- 
ing on outcome k. Let Uk be the corresponding uni- 



tary operator on the entire system, and p^ = Ukp^'Uj 
be the density operator of the entire system at £3 corre- 
sponding to outcome k. We define /5 3 = Y^kPkP^- Note 
that the feedback control is characterized by {Mk} and 

Stage 5 (Equilibration and final state) From £3 to t{, 
the entire system evolves according to unitary operator 
Uf which is independent of outcome k. We assume that 
by if system S and heat bath B m will have reached ther- 
modynamic equilibrium at temperatures T and T m , re- 
spectively. We denote as pt the density operator of the 
final state of the entire system, which is related to the 
initial state as 



Pi 



= £(pi) = J2 UfUkMkUmUi MMul (4) 



We emphasize that pt need not equal the rigorous canon- 
ical distribution Pf an , as given by 



Pi 



eM-PHf) eM~PiH Bl ) ^ exp(-0 n H l 



Zf 



where Zf = tr{exp(— (3Hf)}. We only assume that the fi- 
nal state is in thermodynamic equilibrium from a macro- 
scopic point of view [131 ]. 

We will proceed to our main analysis. The difference 
in the von Neumann entropy between the initial and fi- 
nal states can be bounded from the foregoing analysis as 
follows: 

s(pO - s(p t ) 

=S(pi) - S(p 3 ) 

<S(ft)-J>* S (/i3 fc) ) 



=S(pi 



--S(pi) 



where S(p) = — tr(plnp) is the von Neumann entropy 
and H({pk}) = — X^feeJC P k m Pfc ^ s the Shannon infor- 
mation content. Note that in deriving the inequality ([6]), 
we used the convexity of the von Neumann entropy 




i-e. S(£ k p k $>) > E k PkS(pD- 
H(P!,X) ^-^trCv^piV^ln 

I(p 1 :X) = S(p 1 )+H({p k }) 
we obtain 



Defining notations 
s/lhfay/Dk) and 
-H(px,X), (7) 



(8) 



We refer to I(pi : X) as the QC-mutual information 
content which describes the information about the mea- 
sured system that has been obtained by measurement. 
As shown later, I(pi :X) satisfies 



0<I(p 1 :X)<H({p k }). 



(9) 



(5) 



We note that I(p\ : X) = holds for all state p\ 
if and only if Dk is proportional to the identity op- 
erator for all k, which means that we cannot obtain 
any information about the system by this measure- 
ment. On the other hand, I(p\ : X) — H({pk}) holds 
if and only if Dk is the projection operator satisfying 
[pi,Dk] = for all k, which means that the measure- 
ment on state pi is classical and error-free. In the case 
of classical measurement (i.e. [pi,L>k] = for all fc), 
I(pi '■ X) reduces to the classical mutual information 
content. In fact, we can write I(pi : X) in this case as 
I(p! : X) = -X«*hig 4 - Efc /J *p(fc|*)lnp(fc|«) I where 
Pi = J^i lil^i) H'il is the spectrum decomposition of the 
measured state, and p(k\i) = (ipi\Dk\ijji) can be inter- 
preted as the conditional probability of obtaining out- 
come k under the condition that the measured state is 
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I(pi : X) can be written as I{p\ : X) — xiiP^}) ~ 

Apneas, where x({pi }) = S (fc) ~ J2kex PkS(p^)Js 
the Holevo x quantity which sets the Holevo bound [24j, 
261 ] . and ASmeas = S(p2) — S(p\) is the difference in the 
von Neumann entropy between the pre-measurement and 
post-measurement states. If AS mcas = holds, that is, if 
the measurement process does not disturb the measured 
system, then I(f>\ :X) reduces to the Holevo x quantity; 
in this case, the upper bound of the entropy reduction 
with discrete quantum feedback control is given by the 
distinguishability of post-measurement states {pg }• 

Nielsen et al. have derived inequality S(pi) — S(pf) < 
S(p u £) 0,111, where S(p u £) is the entropy exchange 
which depends on entire process £, including the feed- 
back process. In contrast, our inequality (|8|) is bounded 
by I(pi :X) which does not depend on the feedback pro- 
cess, but only depends on pre-measurement state p\ and 
POVM {£>k}, namely, on the information gain by the 
measurement alone. 

It follows from inequality ([8]) and Klein's inequality [27] ] 
that 



S(pi) < -tr(p f lnpf n )+J(pi:X). 



(10) 



Substituting Eqs. §S§ and (O into inequality (fTD|) . we 
have 

(Ef -Ef)+J2jr- ~ £? m ) < " Ff+ kBTIfr : X) , 

-i m 
m—1 

(ii) 

where Ef = tr(iZfpi), Ef = tr(if f s p f ), Ef m = 
tr{H Bm pi), and Ef m = tr(iT Bm p f ). Defining the differ- 
ence in the internal energy between the initial and final 
states of system S as AU S = Ef — Ef, the heat ex- 
change between system S and heat bath B m as Q m = 
Ef m — E Bm , and the difference in the Helmholtz free 
energy of system S as AF S = F f s — F-f, we obtain 

n rp 

-AU S +J2 Tp-Qm < -AF S + kaTIfaiX). (12) 

m—1 

This is the main result of this Letter. Inequality (fT2|) rep- 
resents the second law of thermodynamics in the presence 
of a discrete quantum feedback control, where the effect 
of the feedback control is described by the last term. For 
a thermodynamic heat cycle in which I(p% : X) = 0, 
AU S = 0, and AF s = hold, inequality §T2$ reduces to 
the Clausius inequality 



(13) 



The equality in (fT2|) holds if and only if p^ is indepen- 
dent of measurement outcome k (i.e. the feedback control 
is perfect), and pi coincides with pf** 1 . 



We will discuss two important cases for inequality. Let 
us first consider a situation in which the system under- 
goes an isothermal process in contact with single heat 
bath B at temperature T. In this case, (fT2")) reduces to 



W CKt < -AF s + k B TI(p 1 :X), 



(14) 



where the first law of thermodynamics, W e *t — 
12m=i Q" 1 ~ AU S , is used. Inequality (fT4|) implies that 
we can extract work greater than —AF S from a single 
heat bath with feedback control, but that we cannot ex- 
tract work larger than —AF s +kBTI(pi :X). If we do not 
get any information, JT3]) reduces to (JTJ) . On the other 
hand, in the case of classical and error-free measurement, 
becomes W ext < ~AF S + k B TH({ Pk }). 
The upper bound of inequality (|14[) can be achieved 
with the Szilard engine [2] which is described as fol- 
lows. A molecule is initially in thermal equilibrium in 
a box in contact with a heat bath at temperature T. We 
quasi-statically partition the box into two smaller boxes 
of equal volume, and perform a measurement on the sys- 
tem to find out in which box the molecule is. When the 
molecule is found in the right one, we remove the left 
one and move the right one to the left position, which 
is the feedback control. We then expand the box quasi- 
statically and isothermally so that the final state of the 
entire system returns to the initial state from a macro- 
scopic point of view. During the entire process, we obtain 
In 2 of information and extract fee Tin 2 of work from the 
system. 

We next consider a heat cycle which contacts two heat 
baths: Bh at temperature Tjj and Bl at TL with Th > 
T L . We assume that Hf = Hf, AU S = 0, and AF S = 0. 
Noting that W cx t = Qh + Ql , we can obtain 

W cxt < (l - ^ Q H + kamipi :X). (15) 



Without a feedback control, (|15|) shows that the upper 
bound for the efficiency of heat cycles is given by that 
of the Carnot cycle: W^/Qh < 1 - T L /T H . With 
feedback control, (fl"5j) implies that the upper bound 
for the efficiency of heat cycles becomes larger than 
that of the Carnot cycle. The upper bound of |(T5j) 
can be achieved by performing a Szilard-type opera- 
tion during the isothermal process of the one-molecule 
Carnot cycle; if we perform the measurement and feed- 
back with In 2 of information in the same scheme as the 
Szilard engine during the isothermal process at temper- 
ature Th, the work that can be extracted is given by 
= (1 - 7l/Th)(Qh - fc B T H ln2) + fc B T H ln2 = 
(1 - T l /T h )<3h + fcBT L ln2. Note that we can reach 
the same bound by performing the Szilard-type opera- 
tion during the isothermal process at temperature Tl. 

We now prove inequality (|9|). For simplicity of no- 
tation, we consider a quantum system denoted as Q in 
general, instead of S and B m 's. The measured state 
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of system Q is written as p, and POVM as {L>k}kex- 
We introduce auxiliary system R which is spanned by 
orthonormal basis {\(pk)}kex, and define two states b\ 
and o- 2 of Q + R as (Ti = J2k VP D kVp ® \(f>k){4>k\ and 
^2 = J2k ^f~^kP\f~D~k ® \4>k)(4>k\- It can be shown 
that tr(Vp-DfcVp) = tr(\/5^V'Sk) = Pk, tr^oi) = 
p, and tr Q (<7i) = Y.kPkl^k) (4>k\ = Pr- Defining 



y/pDk^/p/pk, a. 



(k) 



J2kPk&2 k \ we have 



DklPk and /S' 



S(a 2 ) = J2PkS[vDkP\JD k ® |0fc)(0fc|/pfc ) +^({Pfc}) 



,(fe), 



^({p fe }) = #( / 3,X). 



(16) 



Since S{L<L) = S(LU) holds for any linear opera- 
tor 2, we have S , (a 2 ) = Y,kPk S (^) + H({p k }) = 
E k PkS(cr[ k) )+H({ Pk }) = Sib!). Therefore 

H{p, X) = S{b{) < S(p) + S(M) = S(p) + H({p k }), 

(17) 

which implies I(p:X) > 0. The equality in (fT7|) holds for 
all p if and only if <ti can be written as tensor prod- 
uct p <g> /5r for all p: that is, D k is proportional to 
the identity operator for all k. We will next show that 
I{p ■ X) < H({p k }). We make spectral decompositions 
as P = E^IV^K^I and p' = V. r^'AM], where 



= E 4 * d 



and define dy = 



j 3 ■ j\r 3 i 

EM 

where Ej = 1 f° r all j and Y]j djj = 1 for all i. 
It follows from the convexity of —a; In a; that S(p) = 
- J2i 1i m Qi < - J2j r j lnr i = S(p'). Therefore, 



H({p k })-I(p:X)=H(p,X)- 
= H({ Pk }) 



S( P ) 



>ff({ Pfc })+Ep fe 5((7 : 



2 J 



> 0. 



S(P) 
S(p') 

(18) 

It can be shown that the left-hand side is equal to zero for 
all p if and only if D k is a projection operator satisfying 
[p, D k ] = for all k. 

Our results do not contradict the second law of ther- 
modynamics, because there exists an energy cost for in- 
formation processing of the feedback controller 0, 0, 0] • 
Our results are independent of the state of the feedback 
controller, be it in thermodynamic equilibrium or not, 
because the feedback control is solely characterized by 
{M k } and {U k }. 

In conclusion, we have extended the second law of ther- 
modynamics to a situation in which a general thermody- 
namic process is accompanied by discrete quantum feed- 
back control. We have applied our main result (TT2]) to an 



isothermal process and a heat cycle with two heat baths, 
and respectively obtained inequalities (fl~4| and (fl~5]) . We 
have identified the maximum work that can be extracted 
from a heat bath(s) with feedback control; the maximum 
work is characterized by the generalized mutual infor- 
mation content between the measured system and the 
feedback controller. 
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